Goto

Collaborating Authors

 oxel mamba



Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection

Neural Information Processing Systems

Serialization-based methods, which serialize the 3D voxels and group them into multiple sequences before inputting to Transformers, have demonstrated their effectiveness in 3D object detection. However, serializing 3D voxels into 1D sequences will inevitably sacrifice the voxel spatial proximity. Such an issue is hard to be addressed by enlarging the group size with existing serialization-based methods due to the quadratic complexity of Transformers with feature sizes.